Controlling The Transition Glitch That Occurs When a Gateway 
Switches From Voice Over IP to Voice Band Data 

Field of the invention: 

5 The present invention relates to IP (Internet Protocol) networks and more 
particularly to the transmission of voice and data over IP networks. 

Background of the invention: 

Data is transmitted over the Internet in the form of packets. When a voice signal 
10 is transmitted over the internet the gateways operate in what is called "audio 
mode". In audio mode, the continuous voice signal is divided into packets, the 
packets are transmitted over the Internet, and at the receiving end, the packets 
are again assembled into a continuous voice signal. Since the time required for 
packets to travel over the internet is variable, the gateway at the receiving end 
15 includes a play-out buffer which receives and stores packets for a small interval. 
By storing received packets for a short interval, the play-out buffer provides a 
better opportunity for the system to assemble the packets in the correct order. 

Setting the length of the play-out buffer involves a tradeoff between setting the 
20 delay too long or too short. If the play-out buffer is too long, it will introduce too 
much latency which is annoying in a telephone conversation, on the other hand, if 
the length of the play-out buffer is too short too many packets will be missed. 
Generally the length of the play-out buffer is set adaptively such that the length of 
the buffer is dependent upon the characteristics of the network at each particular 
25 instant in time. There is a large amount of technical literature devoted to various 
techniques for making the length of the play-out buffer adaptive. 

When data from a modem or fax is transmitted over the Internet, the data is 
generally transmitted as Voice Band Data (VBD). When a gateway detects 
30 signals from a modem or fax, the gateway switches to VBD mode. 

When a gateway is operating in VBD mode, the play-out buffer is typically set to a 
relatively long value (for example, a value exceeding 100ms). The reason for this 
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is that VBD is more tolerant to delay than is voice data; however, VBD is less 
tolerant of missed packets than is voice data. 

When a modem (or fax) places a call to a second modem (or fax) the routers in 
5 the system detects the answer tone from the second modem (or fax) and the 
network gateways switch from audio mode to VBD mode. When this transitions 
occurs, the system switches from what may be a relatively short play-out buffer in 
audio mode to a relatively long play-out buffer in VBD mode. When the switch in 
buffer length occurs, there will be a gap in the answer tone which is received by 
10 the calling station. 

For example, consider what occurs when a system is operating in audio mode 
with a play-out buffer of 50ms and the system detects an answer tone and 
switches to a 200ms play-out buffer. At the receiving end there will be a gap in 

15 the signal of about 150ms. When the answering system encounters this gap it 
may assume that the answering system dropped off the line and it may drop off 
the line. It has been found that gaps even as small as 30 to 50ms can cause a 
significant drop in the call success rate of 10 to 20 percent with some client 
devices. It is noted that it may take a voice gateway in the nationhood of 250 ms 

20 to detect an answer tone. Thus, the gap caused by switching from audio mode to 
VBD mode will normally occur after the calling machine begins receiving the 
answer tone. It is noted that if the gap had been in the beginning part (the first 50 
to 100ms) of the answer tone, the gap would have little if any effect since the 
answering machine would not as yet have recognized the answer tone. 

25 

The problem addressed by the present invention, is the anomaly that occurs when 
a system switches from audio mode (with a short play-out buffer) to VBD mode 
(with a long play-out buffer). 

30 Summary of the Invention: 

With the present invention, the length of the play-out buffer in VBD mode is not set 
to a previously established value. Instead the length of the play-out buffer in VBD 
mode is determined by the length of the buffer in the previous audio mode. When 
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the system switches from audio mode to VDB mode, the final length of the play- 
out buffer in audio mode is noted. The length of the play-out buffer in VBD mode 
is then set equal to its length in the prior audio mode just before the transition 
occurred plus a dilation factor. An audio concealment algorithm (which fills in 
5 gaps caused by missed packets) is used during the transition from audio to VDB. 

Brief Description of the Figures: 

Figure 1 is an overall system diagram. 
Figure 2 is a diagram that shows the operations that occur. 
10 Figure 3 is a flow diagram showing the operation of the system. 

Detailed Description of Preferred Embodiments: 

The overall configuration of a system which embodies a first embodiment of the 
present invention is shown in Figure 1. The system shown includes a first client 
15 designated C1 and a second client designated C2. The clients C1 and C2 can be 
either fax machines or modems. 

The clients are connected through gateways G1 and G2 and network N. The 
gateways G1 and G2 include codecs CD1 and CD2, play-out buffers PB1 and 

20 PB2 and control programs CP1 and CP2. In normal operation, the clients receive 
information as follows: packets received from the network are assembled in the 
play-out buffers PB1 and PB2 and then passed to the codecs CD1 and CD2. The 
codecs change the digital information to analog information and pass the analog 
information to the clients. When clients are sending information the clients 

25 provide analog information to the codec, the codecs digitize the information and 
the gateway assembles the digital information into internet packets. The above 
described operations are the conventional manner in which gateways operate. 

The gateways G1 and G2 can be conventional, commercially available gateways 
30 such as those marketed by the Cisco Corporation under the family designation 
AS53xx, 26xx, or 37xx Alternatively, the gateways G1 and G2 could be various 
other commercially available gateways. The present invention can be 
implemented by modifying the control programming as explained below. 
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Figure 2 gives the sequence of operations which occur when client C1 places a 
call to client C2. The initial contact between the clients is over a normal audio or 
voice connection as indicated by block 201 . As indicated by block 202, client C2 
5 responds to the call with an answer tone. The gateways detect the answer tone 
as indicated by block 203 and they switch to VBD mode. What occurs relative to 
the play-out buffer when this occurs will be explained later. Under normal 
circumstances the client C1 recognizes the answer tone as indicated by block 204 
and brings up its modem as indicated by block 205. Data transmission then 
10 begins as indicated by block 206. 

When the system is operating in audio mode, the length of the play-out buffers is 
set in an adaptive manner depending upon the conditions of the system. The 
algorithm for setting the length of the play-out buffer, tries to balance the length of 
15 the buffer between the following two considerations: 

1 ) A long play-out buffer can minimize the number of packets that are 
missed; however, it can introduce unacceptable delay in telephone calls . 

2) A short play-out buffer can minimize call delay; however, it may cause 
too many packets to be missed. 

20 The length of the play-out buffer is generally set adaptively in audio mode to take 
into account the instantaneous condition of the network. There is a great deal of 
technical literature which teaches how to adaptively managing the length of a 
play-out buffer. In the embodiment described here, when the gateways G1 and 
G2 are in audio mode, they adaptively control the length of the play-out buffer in 

25 accordance with the prior art teachings. 

The problem addressed by the present invention is what occurs when the system 
switches between audio mode and VBD mode. If, for example, the system is in 
audio mode with a 50 ms play-out buffer and the system switches to a 200 ms 
30 buffer in VBD mode, the will be a gap of 1 50 ms in the signal received by the 

client. If this transition occurs while the client is receiving an answer tone, this can 
cause the receiving client to drop the call. 



c-318 idea #328623 



Page 4 (of 10 pages) 



ver B 



With the present invention, during the transition audio concealment is use to fill in 
for any missed packets. With the present invention when a system switches from 
audio mode to VDB mode, the steps shown in Figure 3 occurs. 



5 First, as indicated by block 301 , the system detects the length of the play-out 
buffer at the instant prior to the change over. Next as indicated by block 302 a 
dilation factor "D" is added to the detected amount. The play-out buffer is then set 
to the sum of the prior amount plus the value of the dilation factor as indicated by 
block 303. Any gaps which result due to this change in value are filled in using 

10 conventional audio concealment techniques as indicated by block 304. Since 

audio concealment techniques are applied, there will not be gaps in the amount of 
energy in the signal produced. There may be anomalies or "glitches" in the phase 
of the signal; however, these are normally benign to the client modems as long as 
transition to VBD occurs soon after the switch over (i.e., less than 400 ms since 

15 start of ANS signal). Finally data transmission begins as indicated by block 305. 

The magnitude of the dilation factor "D" should be large enough so as to not 
increase the possibility of missed packets during VBD mode. It should also be 
small enough, so any resulting glitch can be covered by the concealment 
20 algorithm. A dilation factor "D" of 50ms has been found to be satisfactory in many 
normal situations. 

It is noted that the clients shown in Figure 1 could be either fax machines or a 
modems which place calls to a second fax machine or modem. The gateways 
25 switch from audio mode to VDB data mode when the gateways detect the answer 
tone from the client being called. This occurs in a conventional manner. The 
present invention is only directed to a technique for setting the length of the play- 
out buffer when this transitions occurs. 

30 In summary, with the present invention, when a gateway switches from audio 
mode to VBD mode, the length of the play-out buffer is increased by an amount 
that is dependent on its value in the previous audio made. When the switch is 
made from audio mode to VBD mode, the length of the play-out buffer is set to a 
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value which is the sum of it previous length in the audio mode and a dilation 
factor. As a result, with the present invention the call success rate is increased. 



While the invention has been described with reference to a preferred embodiment 
5 thereof, it should be understood that various changes in form and detail can be 
made without departing from the spirit and scope of the invention. 



I claim: 
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